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Abstract 

This paper is concerned with the application of nonbinary low-density parity-check (NB-LDPC) 
codes to binary input inter-symbol interference (ISI) channels. Two low-complexity joint detection/decoding 
algorithms are proposed. One is referred to as max-log-MAP/X-EMS algorithm, which is implemented 
by exchanging soft messages between the max-log-MAP detector and the extended min-sum (EMS) 
decoder. The max-log-MAP/X-EMS algorithm is applicable to general NB-LDPC codes. The other 
one, referred to as Viterbi/GMLGD algorithm, is designed in particular for majority-logic decodable 
NB-LDPC codes. The Viterbi/GMLGD algorithm works in an iterative manner by exchanging hard- 
decisions between the Viterbi detector and the generalized majority-logic decoder (GMLGD). As a by- 
product, a variant of the original EMS algorithm is proposed, which is referred to as /i-EMS algorithm. 
In the /i-EMS algorithm, the messages are truncated according to an adaptive threshold, resulting in a 
more efficient algorithm. Simulations results show that the max-log-MAP/X-EMS algorithm performs 
as well as the traditional iterative detection/decoding algorithm based on the BCJR algorithm and the 
QSPA, but with lower complexity. The complexity can be further reduced for majority-logic decodable 
NB-LDPC codes by executing the Viterbi/GMLGD algorithm with a performance degradation within 
one dB. Simulation results also confirm that the /t-EMS algorithm requires lower computational loads 
than the EMS algorithm with a fixed threshold. These algorithms provide good candidates for trade-offs 
between performance and complexity. 
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I. Introduction 

Nonbiliary low-density parity-check (NB-LDPC) codes were first introduced by Gallager based 
on modulo arithmetic [1]. In [2], Davey and Mackay presented a class of NB-LDPC codes 
defined over the finite field ¥ q . They also introduced a Q-ary sum-product algorithm (QSPA) for 
decoding NB-LDPC codes. NB-LDPC codes outperform their binary counterparts when used 
in the channels with burst errors or combined with higher-order modulations. However, the 
applications of NB-LDPC codes are limited due to their high decoding complexity. To reduce 
the decoding complexity, a more efficient QSPA based on fast Fourier transform (FFT-QSPA) 
was proposed in [3] [4]. To further reduce the decoding complexity, extended min-sum (EMS) 
algorithms were proposed in [5] [6]. The EMS algorithm in [6] was re-described in [7] as a 
reduced-search trellis algorithm, called M-EMS algorithm. Also presented in [7] are two variants 
of the M-EMS algorithm, called T-EMS algorithm and D-EMS algorithm, respectively 1 . For 
majority-logic decodable NB-LDPC codes, different low-complexity decoding algorithms have 
been proposed [8] [9]. Different construction methods of NB-LDPC codes have been proposed 
in the literature, see [10-12] and the references therein. 

The inter-symbol interference (ISI) is a common phenomenon in high-density digital recording 
systems and wireless communication systems [13]. Different equalizers have been proposed 
in the literature [14-23]. Since the invention of the turbo codes [24], the rediscovery of the 
LDPC codes [1], and most importantly, the success of the applications of turbo principle to 
equalizations [19] [25] [26], many works have been done to apply turbo principles to coded ISI 
channels [27-33], where binary convolutional codes, turbo codes or LDPC codes are usually 
considered as the "outer codes" of the serial concatenated system. However, few works are 
available for the NB-LDPC coded ISI channels. An example is given in Appendix showing that 
nonbinary may be more suitable to combat inter-symbol interference. 

In this paper, we investigate reduced complexity detection/decoding algorithms for NB-LDPC 
coded ISI channels. Two low-complexity joint detection/decoding algorithms are proposed. For 
general NB-LDPC coded ISI channels, we propose the max-log-MAP/X-EMS algorithm, in 
which the detector and the decoder are implemented with the max-log-MAP algorithm and the 
X-EMS algorithm, respectively. In this algorithm, the detector takes as input the soft extrinsic 

'All these variants are referred to as X-EMS algorithms in [7], 
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messages from the decoder and delivers as output the soft extrinsic messages of each coded 
symbol; the decoder takes as input the messages from the detector and feeds back to the detector 
the soft extrinsic messages of each coded symbol. Simulations results show that the max-log- 
MAP/X-EMS algorithm performs as well as the traditional iterative detection/decoding algorithm 
based on the BCJR algorithm and the QSPA, but with reduced complexity. Meanwhile, a variant 
of the original T-EMS algorithm is proposed, which is referred to as /x-EMS. The threshold of 
the /i-EMS algorithm is adaptive and hence can be matched to channel observation. Simulation 
results show that the proposed yU-EMS algorithm is more effective than the original T-EMS 
algorithm when applied to coded ISI channels. For majority-logic decodable NB-LDPC coded ISI 
channels, a further complexity-reduced joint detection/decoding algorithm is proposed, referred 
to as Viterbi/GMLGD algorithm, which is based on the Viterbi algorithm and the generalized 
majority-logic decoding (GMLGD) algorithm [9]. In the Viterbi/GMLGD algorithm, the Viterbi 
detector takes as input the messages from the decoder and delivers as output the hard-decision 
sequence; the decoder takes as input the hard-decision sequence from the detector and feeds back 
to the detector the estimated messages of each coded symbol. Simulations results show that the 
Viterbi/GMLGD algorithm suffers from a performance degradation within one dB compared with 
BCJPv/QSPA. These algorithms provide good candidates for trade-offs between performance and 
complexity. 

The organization of this paper is as follows. Section II introduces the considered system model. 
Also given in Section II is the quantization algorithm to initialize the detector. The max-log- 
MAP/X-EMS algorithms and the Viterbi/GMLGD algorithm are described in Section III and 
Section IV, respectively. Complexity comparisons and simulation results are given in Section V. 
Section VI concludes this paper. 

II. NB-LDPC Coded ISI Channel 

A. NB-LDPC Codes 

Let ¥ q be the finite filed with q = 2 rn elements. A NB-LDPC code C q [N,K] is defined as 
the null space of a sparse nonbinary parity-check matrix H = [h i: j] M xN, where h iy j G ¥ q . A 
vector v— (v , v 1: v 2 , • • -Vq-i) is a codeword if and only if Ht> T = 0. For convenience, 

we define the two index sets as follows: 
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Fig. 1: System model of a NB-LDPC coded ISI channel. 



Ni = {j : < j < N - 1, hij ^ 0} for each row i of H 

and 

M.j = {i : < i < M — 1, /iy 7^ 0} for each column j of H. 

B. ISI Channel Model 

The ISI channel of order L is characterized by a polynomial 

f(D) = f + fiD + f 2 D 2 + --- + f L D L , (1) 

where the coefficients fi E E. Let x t E X = { — 1,+1} be the channel input at time t. The 
output signal y t at time t is statistically determined by 

L 
1=0 

where w t is a sample from a white Gaussian noise with two-sided power spectral density a 2 = 

No/2. 

C. The System Model 

The system model of a NB-LPDC coded ISI channel is shown in Fig. 1. 

Encoding: The sequence u = (u ,Ui,u 2 , ■ ■ ■ , E ¥^ to be transmitted is encoded by 

the NB-LDPC encoder into a codeword v = (v , v i, v 2 , ■ ■ ■ , v^-i) € F^. 

Modulation: The codeword v is interpreted as a binary sequence c = (c , Ci, • • • c n ) with 

= miV by replacing each component Vj with its binary representation in ¥ q . The binary 
sequence Cj is then mapped into a bipolar sequence x = (x ,Xi, ■ ■ ■ , sc n _i) with x 4 — 2v t — 1 
and transmitted over the ISI channel. 

Detection/Decoding: Upon receiving y, the receiver attempts to recover the transmitted data 
u. This can be done by following the well-known turbo principle [24] and executing an iterative 




message processing/passing algorithm [34] over the normal graph [35] shown in Fig. 2. The 
normal graph has four types of nodes (constraints): M check nodes (C-node), N variable 
nodes (V-node), iV trellis nodes (T-node) and 5 H-node, where 5 denotes the number of nonzero 
elements in the parity-check matrix H. The main ingredients of the message processing/passing 
algorithm include 

• Detector: The commonly used detection algorithms are the Viterbi algorithm [15], the BCJR 
algorithm [16] [36] and the max-log-MAP algorithm [37] [38]. 

• Decoder. The commonly used decoding algorithm are the QSPA (or FFT-QSPA) [2-4] 
and the X-EMS algorithms [5] [7]. For majority-logic decodable NB-LDPC codes [39], the 
decoder can also be implemented with the GMLGD algorithm [9]. 

Assume that the detector and the decoder are implemented with algorithms A and B, respec- 
tively. We define the following two different schedules 
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• A — > B: The detector executes the algorithm A only once, then the decoder performs the 
decoding algorithm B. 

• A 4-)- B: The detector and the decoder work in an iterative manner by exchanging either 
soft messages or hard messages between A and B. 

In this paper, a reduced complexity detection/decoding algorithm based on the max-log- 
MAP algorithm and X-EMS algorithms is proposed (max-log-MAP— t>X-EMS or max-log- 
MAP^X-EMS). For majority-logic decodable NB-LDPC codes, we propose a further reduced 
complexity detection/decoding algorithm based on the Viterbi algorithm and the GMLGD al- 
gorithm (Viterbi<H>GMLGD). The conventional dectection/decoding algorithms based on the 
BCJR algorithm and the QSPA, denoted by BCJR^QSPA and BCJR^QSPA, will be taken 
as benchmarks for comparison. 

D. Sectionalized Trellis 

It has been shown that the ISI channel can be represented by a time-invariant trellis [14]. 
At each stage, the trellis has 2 L states. Emitting from each state, there are two branches, 
corresponding to binary inputs and 1, respectively. For convenience, this trellis is referred 
to as the original trellis. When an iterative joint decection/decoding algorithm is adopted, we 
need to exchange messages between the detector and the decoder. The processing of the decoder 
is symbol-oriented, while the original trellis is bit-oriented. So it is necessary to transform from 
symbol-based messages to bit-based messages and vice versa, which requires additional compu- 
tational loads and may cause performance degradations. A way to avoid such a transformation is 
to work on a sectionalized trellis [40] directly, which can be obtained from the original trellis. 
For example, the original trellis and the sectionalized trellis matched to F 16 on the the dicode 
channel f(D) = 1 — D are illustrated in the Fig. 3. 

• The sectionalized trellis has N section, which are indexed by < j < N — 1. The j-th 
section corresponds to the j-th coded symbol vj. 

• At each stage, there are 2 L states, which are simply indexed by < s < 2 L — 1. Each 
state at the j-th stage corresponds to a bipolar sequence of the length L, that is Sj -H- 
(x(j-i)L, ■ ■ ■ , XjL-i), where x t is the input to the channel at time t, and x t , t < is assumed 
to be known at the receiver. The collection of the states at the j-th stage is denoted by Sj. 
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Fig. 3: Trellis representations of the dicode channel, (a) The original trellis, (b) The sectionalized 
trellis matched to Fi 6 . 



• Emitting from each state, there are 2 m branches. Each branch in the j-th section is specified 
by a 4-tuple b = (sj, Vj, Zj, Sj+i), where Vj E ¥ q is the j-th possible coded symbol that 
takes the state from Sj into Sj + i and results in the noiseless output vector Zj of length m. 
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In other words, each branch in the sectionalized trellis corresponds to a path of length m 
in the original trellis. The collection of branches in the j-th section is denoted by Bj, we 
have \Bj\ = 2 L+m . 

E. Possibility function Calculation 

Like most reduced complexity algorithms, we use log-domain messages in the proposed 
algorithm. Let Z be a discrete random variable taking on values over Z. We use P z (z),z E Z 
to denote its probability mass function (pmf). Its possibility function is defined as L z (z) = 
[do log Pz{z) + ai], z E Z, where [x] represents the integer closest to x E K and a > 0, a\ E R 
are two constants. Obviously, we can confine the range of L z (z) to be [0,2 P — 1] by properly 
choosing parameters a and a±. In this case L z (z) is also referred to as a p-bit possibility 
function. The possibility function can be considered as an integer measure on the possibility of 
the occurrence of each value z E Z. Let X denote the variable on the edge connecting the node 
Ai and Ai in Fig. 2. We will use L^ 1 ^^ to denote the message from Ai to A2. 

To each branch in the j-th section of the sectionalized trellis, we assign an integer [Zj), 
where Zj is the associated noiseless output. The possibility function L^ Tj \zj) can be determined 
using the following algorithm. 

Algorithm 1: Given the received vector y,2 p — l and the maximum allowable squared Euclidean 
distance d max for quantization. For j = 0,1, ■ ■ ■ , N — 1, 

Step 1 : Calculate d(z t ) = \\yj — Zj\\ 2 , which is the squared Euclidean distance between yj 
and zy, 

Step 2 : If d(z t ) > d max , set d(z t ) = d 

max t 

Step 3 : For each noiseless output zj, calculate 



It can be easily checked that (zj) is a p-bit possibility function. For the least pos- 

sible element Zj, we have L z ^ T ^{zj) = 0; while for the most possible element Zj, we have 
Lz~* Tj \ z j) — 2 P —1. Notice that the variance of the noise is not required to determine L^~* Tj \zj). 

Remarks: It should be pointed out that the maximum allowable Euclidean distance d rnax is 
time-invariant which ensures that a in the possibility function is time independent. In this paper, 




(3) 
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the max-log-MAP algorithm and the Viterbi algorithm are implemented over the sectionalized 
trellis with the p-bit possibility function as branch metrics. As a result, the detectors require only 
integer operations. 

III. The Max-log-MAP^X-EMS Algorithm 

A. T-node: Max-log-MAP Detection 

To each branch bj = (sj,Vj,Zj, s^+i) and < j < N, we assign an integer metric 

L J (b j ) = ^\z j ) + L^ ) (v j ), (4) 

where L^ 3 ^ 7 ^^) are initialized as zeros and L^ Tj \zj) is determined by Algorithm 1. Then 
we can execute the max-log-MAP algorithm to obtain an extrinsic possibility vector l}y^ V:i \ 
for < j < N - 1. 

Remark: It should be pointed out that the possibility vector L^~ j ~* Vj \vj) is normalized such 
that the reliability of the least possible element is equal to 0. 

B. V-node: Computing the Extrinsic Message to H-node 

Given Xij — x , the event of an V-node Vj being satisfied is equivalent to the event {Vj = 

x} f\ k¥li {X ki = x}. We have 

where L^^ Vl \x) is the message from the max- log-MAP detector and L { ^p V] \x) are initial- 
ized as zeros for x e ¥ q . 

C. H-node: Message Permutation 

Given = y, the event of an H-node Hij being satisfied is equivalent to the event {X^ = 
hij l y}- We have 

L%r Ci \y) = L ( ^\h^y), y G F ff . (6) 
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D. C-node: Computing the Extrinsic Message to H-node 

Message-truncation rules: Given the message L Y %3 , we can partition the finite field F„ 
into J 7 and ¥ q — J 7 . Three different message-truncation rules have been proposed in [7]. That 
are 



jr M = [ y e W q \L^ 3 \y) is one of the M largest components of Ly£ j '}, 

T T = {yeW q \L%;^\y)>T}, 

and 

T D = {y e ¥ q \L max - Lf:^ Ci \y) < D}, 

(7~t ' ' >C ) 

where L max denotes the largest component of L Y . 11 and D is a designated parameter. In this 
paper, we give a new truncation rule 

^ = {ye¥ q \L^\y)>^}, 

where fx is determined by 

n = -T, I %?" Ct) (v)- c > (7) 

where c is a constant to be designated. That is, /j, is equal to the mean of the possibility vector 

(*U . . ■ ) 

L Y . 13 with an offset of c. The resultant EMS algorithm is referred to as /i-EMS here. 

Given a truncation rule, the possibility vector Ly* 1 from the C-node Cj to the H-node 
Hij can be calculated by a reduced trellis search algorithm. See [7] for details. 

Remark: The truncation rule is simpler than the truncation rule Fm, since no ordering is 
required. The truncation rule is similar to Tt except that the threshold of is data-dependent 
and hence can be matched to data and iterations. 
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E. H-node: Message Permutation 

Given X {j = x, the event of an H-node being satisfied is equivalent to the event {Y^ = 
hijx}. Then 

L^ v '\x) = L^\h l3 x), x E ¥ q . (8) 



F. V-node: Making Decisions and Computing the Extrinsic Message to T-node 
For the V-node Vj, < j < N — 1, calculate the message 

L Vj (x) = L^^ix) + L { ^ Vj \x) (9) 
and make decisions according to 

Vj = argmaxLy (x). (10) 

J xe¥ q 3 

If Hv 1 = 0, output v as the estimated codeword. If Hv ^ 0, calculate the message L v 3 3 
from V-node Vj to T-node Tj as 

L^^ ) (x)=L Vj (x)-L^ V3) (x), (11) 

for x e F q . 



G. Summary of The Max-log-MAP^X-EMS Algorithm 

• Initialization: Given y and a truncation rule J 7 , set a maximum iteration number C and an 
iteration variable / = 0. For all Vj and x E F q , set L { ^ Tj) (x) = 0, L { ^ Vj \x) = 0. 

• Iteration: while I < C : 

1) Detection at T-node: Executing the max-log-MAP algorithm with the branch metrics 
as defined in (4) to obtain the possibility vector l}y 3 ^ V ^ . 

2) Messages processing at V-node: for all V-nodes, calculate j}^ 3 ~^ n%3 ^ according to (5). 

3) Messages permutation at H-node: for all H-nodes, permute the messages Ly 13 
according to (6). 

4) Messages processing at C-node: for all C-nodes, calculate the messages Ly* 13 
according to the truncation rule J 7 . 
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5) Messages permutation at H-node: for all H-nodes, permute the messages L^ 3 ^ V ^ 
according to (8). 

6) Messages processing at V-node: for all V-nodes, calculate the messages L v . and find 
Vj. If HiF = 0, output v and exit the iteration; otherwise, calculate the messages 

7) Increment I by one. 

• Failure: If I — C, report a decoding failure. 
Remark: Note that the proposed algorithm requires only integer operations and finite field 
operations. 

IV. The Viterbi^GMLGD Algorithm 

For majority-logic decodable NB-LDPC coded ISI channels, we propose a further complexity- 
reduced joint detection/decoding algorithm based on the Viterbi algorithm and the GMLGD 
algorithm. The parity-check matrix of a majority-logic decodable NB-LDPC code [39] has the 
property that no two rows (or two columns) have more than one position where they both have 
nonzero-components. This guarantees that the Tanner graph of the code is free of cycle of length 
4 and hence has girth of at least 6. In practice, majority-logic decodable NB-LDPC codes with 
redundant rows [41] are preferred. 

A. T-node: Viterbi Detection 

To each branch bj = (sj, Vj, Zj, Sj+i) and < j < N, we assign an integer metric 

L j (b j ) = L%> T '\z j ) + I%^ 7i \v j ), (12) 

where L§? 3 ~* Tj \vj) are initialized as zeros and L^ Tj \zj) is determined by Algorithm 1. Then 
we can run the Viterbi algorithm through the sectionalized trellis to find a path b , b±, ■ ■ ■ , b^-i 
such that the path metric Z)o<?<jv-i Aj'(fy) ^ s maximized. The associated input sequence v is 
then passed to the variable nodes as the hard decisions. 

B. V-node: Syndrome Computation 

After receiving the hard-decision vector v from the T-node, we may calculate the syndrome 

s = vH T = (s , ,s m _i). (13) 
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If s = 0, output v as the decoding result; otherwise, the variable nodes send the hard decision 
vector v together with the syndrome vector s to the check nodes. 

C. C-node: Extrinsic Estimation 

The i-th check node sends back an extrinsic estimate to the j-th variable node, which is 
denoted by a,^ and can be determined by 

<Ti^j = -h~j( hij'Vj') = -Kjsj - iij, (14) 
where % G Mj and all the operations are executed in ¥ q . 

D. V-node: Possibility Function Updates 

Intuitively, for each variable node Vj, the occurrence of each a G ¥ q in the received messages 
{/Ji^j,i G Mj} from check nodes reflects its possibility. Therefore these votes can be used to 
update the possibility function by increasing L^ 3 ~^ Tj \vj), Vj G ¥ q , according to the following 
rule: 

1^ >Tj) ^ -j) <- I-v/ ''^V, -,) + 1, (15) 

for all i e Mj. In words, for a given a G ¥ g , L^ 3 ~^ Tj \a) is a counter that accumulates, up to 
and inclusive of the current iteration, all the occurrences of Vj = Vj in the extrinsic messages 
sent back from the adjacent check nodes. 

E. Summary of The Viterbi^GMLGD Algorithm 

1) Initialization: Given y, calculate the p-bit possibility functions according to Algorithm 1. 
Select a maximum iteration number C > and set / = 0. For all Vj and Vj G ¥ q , set 

4T ri, M=o. 

2) Iteration: While / < £: 

a) Detection at T-node: determines the hard decision sequence v by executing the Viterbi 
algorithm with branch metrics as defined in (12). 

b) Syndrome computation at V-node: compute the syndrome s according to (13). If 
s — 0, output v and exit the iteration; otherwise, send s and v to the C-nodes. 
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c) Extrinsic estimation at C-node: compute a^j according to (14) and send them to the 
V-nodes. 

d) Possibility function update at V-node: update the possibility functions Ly^ T ^ ac- 
cording to (15). 

e) Increment I by one; 

3) Failure Report: If I = C, report a decoding failure. 

V. Complexity Analysis and The Numerical Results 

A. Complexity Analysis 

The computational complexities per iteration of the Viterbi algorithm, the BCJR algorithm, the 
max-log-MAP algorithm, the GMLGD algorithm and the QSPA algorithm are shown in Table I, 
where 5 denotes the number of non-zero elements in H. However, the complexity of the X-EMS 
algorithm varies from iteration to iteration. 

Apparently, for each iteration, the max-log-MAP^X-EMS algorithm and the Viterbi^GMLGD 
algorithm require less operations than BCJR^QSPA. However, they may require more iterations 
to converge. Therefore, for a fair comparison, we take 

total number of operations of a given algorithm 

(16) 

total number of operations of the BCJR<H>QSPA algorithm 

as the complexity measurement. Note that the statistical mean (averaging over frames) of the 
total number of operations involved in all iterations for decoding one frame is used in (16). Also 
note that the ratio in (16) only give a rough comparison, as different algorithms require different 
operations. 

B. Numerical Results 

Let X s and X b denote the parameters in the truncation rule X for state metrics and branch 
metrics, respectively. 

Example 1: Consider the dicode channel with characteristic polynomial f(D) = 1 — D. The 
simulated code is the 32-ary LDPC code C 32 [961, 765] of rate 0.79, which is constructed by the 
properties of finite fields [11]. The corresponding parity-check matrix has row weight 30 and 
column weights 10 and 11. The squared Euclidean distances in Algorithm 1 are quantized with 
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TABLE I: Computational Complexities of Different Algorithms Required Per Iteration. 
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p = 9 and d max = 80. All the algorithms are carried out with maximum iteration C = 50. The 
parameters of the max-log-MAP <H>X-EMS algorithms are listed in the following: 

1) for the /i-EMS algorithm, fi is calculated by (7) with c = 1; for the TJ-EMS algorithm, 
D s = 50, D b = 40; for the T-EMS algorithm, T s = 20, T b = 10; for M-EMS algorithm, 

M = 16; 

2) the scaling factors of the M-EMS algorithm, the T-EMS algorithm, the T>-EMS algorithm 
and the /i-EMS algorithm are 0.4, 0.4, 0.3 and 0.4, respectively. 

The simulation results are shown in the Fig. 4. It can be seen that at bit error rate (BER) 10~ 5 

a) the max-log-MAP<H>X-EMS algorithms perform as well as the BCJR^QSPA; 

b) the max-log-MAP— >X-EMS algorithms have almost the same performances and suffer 
from performance degradations about 0.1 dB compared with BCJR^QSPA; 

c) the Viterbi<R>GMLGD algorithm suffers from performance degradations about 0.6 dB 
compared with BCJR^QSPA. 

The complexity ratios of different detection/decoding algorithms are shown in Fig. 5. It can be 
seen that, at BER=10~ 5 , the Viterbi <H> GMLGD algorithm is the simplest one with complexity 
ratio about 0.05, the max-log-MAP^yU-EMS and the max-log-MAP <H>D-EMS have almost the 
same complexity with complexity ratio 0.5. We also notice that both max-log-MAP<H>T-EMS 
algorithm and BCJR— )>QSPA are more complex than BCJR<H>QSPA. This is because the com- 
plexity reduction per iteration of these two algorithms is not enough to counteract the complexity 
increase caused by the extra iterations 2 . In particular, T-EMS algorithm with a fixed performance- 



2 Actually, all other algorithms require more iterations than BCJRoQSPA to converge in our simulations. 



16 




E b /N (dB) 



Fig. 4: Error performances of different detection/decoding algorithms for decoding the 
C 32 [961,765] coded dicode channel. 



guaranteed threshold T can not reduce too much computations at each iteration due to the large 
dynamic range of the messages. This motivated us to propose the /x-EMS algorithm, which is 
similar to the T-EMS algorithm but with a dynamic and message-matched threshold. 

Example 2: Consider an EPR4 channel with characteristic polynomial f(D) = 1 + D — 
D 2 - D 3 . The simulated code is a 16-ary NB-LDPC code Ci 6 [225, 173] of rate 0.77, which is 
constructed by the properties of finite fields [11]. The corresponding parity-check matrix has 
row weight 14 and column weights 3 and 4. The squared Euclidean distances in Algorithm 1 
are quantized with p = 9 and d max = 180. All the algorithms are carried out with maximum 
iteration C = 50. The parameters for simulation are listed in the following: 

1) for the /i-EMS algorithm, p calculated by (7) with c = 0; for the D-EMS algorithm, 

D a = 45, D b = 35; for the T-EMS algorithm, T s = 30, T b = 10; for M -EMS algorithm, 

M = 10; 
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Fig. 5: Complexity ratios of different detection/decoding algorithms for decoding the 
C 32 [961,765] coded dicode channel. 



2) the scaling factors of the M-EMS algorithm, the T-EMS algorithm, the D-EMS algorithm 
and the /i-EMS algorithm are 0.6, 0.6, 0.6 and 0.75, respectively. 
The simulation results are shown in the Fig. 6. It can be seen that at BER=10 5 

a) the max-log-MAP-B-X-EMS algorithms perform as well as the BCJR^QSPA; 

b) the max-log-MAP-^X-EMS algorithms perform as well as the BCJR-^QSPA; 

c) the max-log-MAP^X-EMS algorithms perform about 0.4 dB better than BCJR^QSPA. 
The complexity ratios of different detection/decoding algorithms are shown in Fig. 7. It can be 
seen that, at BER=10 5 , the max-log-MAPoD-EMS algorithm is the simplest one with com- 
plexity ratio about 0.5, the max-log-MAPo/i-EMS algorithm has a complexity with complexity 
ratio about 0.55. 

Example 3: Consider the Proakis. (b) channel [13, Sec 9.4-3] with characteristic polynomial 
f(D) = 0.407+0.815D + 0.407D 2 . The simulated code is a 16-ary NB-LDPC code C 16 [225, 173] 
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Fig. 6: Error performances of different detection/decoding algorithms for decoding the 
Ci 6 [225, 173] coded EPR4 channel. 



of rate 0.77, which is constructed by the properties of finite fields. The corresponding parity- 
check matrix has row weight 14 and column weights 3 and 4. The squared Euclidean distances 
in Algorithm 1 are quantized with p = 9 and d max = 60. All the algorithms are carried out 
with maximum iteration C = 50. The parameters for simulation are listed in the following: 

1) for the yU-EMS algorithm, fi calculated by (7) with c = 0; for the D-EMS algorithm, 
D s = 45, D b = 35; for the T-EMS algorithm, T s = 10, T b = 5; for M-EMS algorithm, 
M = 10; 

2) the scaling factors of the M-EMS algorithm, the T-EMS algorithm, the D-EMS algorithm 
and the ytx-EMS algorithm are 0.7, 0.6, 0.6 and 0.75, respectively. 

The simulation results are shown in the the Fig. 8. It can be seen that at BER=10~ 5 
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Bit Error Rate 



Fig. 7: Complexity ratios of different detection/decoding algorithms for decoding the 
Ci 6 [225, 173] coded EPR4 channel. 

a) the max-log-MAP<H-X-EMS algorithms perform as well as the BCJR^QSPA; 

b) the max-log-MAP->X-EMS algorithms perform as well as the BCJR-)>QSPA; 

c) the max-log-MAP^X-EMS algorithms perform about 0.3 dB better than BCJR-^QSPA. 
The complexity ratios of different detection/decoding algorithms are shown in Fig. 9. It can 

be seen that, at BER=1CT 5 , the max-log-MAP^H-yU-EMS algorithm is the simplest one with 
complexity ratio about 0.5. 

Remark: From the preceding examples, it can be seen that the complexity ratio of max-log- 
MAP<c->/i-EMS algorithm is always around 0.5. 
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Fig. 8: Error performances of different detection/decoding algorithms for decoding the 
Ci 6 [225, 173] coded Proakis. (b) channel. 



VI. Conclusion 

In this paper, we have proposed two low-complexity joint iterative detection/decoding algo- 
rithms for NB-LDPC coded ISI channels. The proposed algorithms work iteratively by exchang- 
ing either soft or hard messages between the detectors and the decoders. We have also presented 
a low-complexity decoding algorithm NB-LDPC codes. Simulation results show that the max- 
log-MAP^X-EMS algorithm performs as well as BCJR^QSPA, and the Viterbi^GMLGD 
algorithm, which is the simplest one, suffers from a performance degradation within one dB 
compared with BCJR^QSPA. These algorithms provide good candidates for trade-offs between 
performance and complexity. 
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Fig. 9: Complexity ratios of different detection/decoding algorithms for decoding the 
Ci 6 [225, 173] coded Proakis. (b) channel. 



Appendix 

A rough comparison between binary and NB-LDPC codes coded ISI channel is conducted 
in this appendix. We have simulated a binary LDPC code C 2 [495, 433] [42] and a 16-ary NB- 
LDPC code C 16 [124, 107]. These two codes have almost the same bit lengths and code rates. 
The nonbinary code C 16 [124, 107] is constructed by the PEG algorithm [10] with column wight 
2. We have simulated these two codes over EPR4 channels. The simulation results are shown 
in Fig. 10. It can be seen that & 6 [124, 107] performs about 0.6 dB better than C 2 [495, 433]. 
We have also simulated these two codes over AWGN channels. The simulation results are also 
given in Fig. 10. It can be seen that Ci 6 [124, 107] performs only 0.2 dB better than C 2 [495, 433] 
as apposed to 0.6 dB. We conclude that NB-LDPC codes may be more suitable to combat 
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E b /N (dB) 



Fig. 10: Error performances of different algorithms for decoding the Ci 6 [124, 107] and binary 
LDPC code (62,495) coded AWGN channel and the EPR4 channel. 



inter- symbol interferences. 
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